Detecting rumours with latency guarantees using massive streaming data
نویسندگان
چکیده
Today’s social networks continuously generate massive streams of data, which provide a valuable starting point for the detection rumours as soon they start to propagate. However, rumour faces tight latency bounds, cannot be met by contemporary algorithms, given sheer volume high-velocity streaming data emitted networks. Hence, in this paper, we argue best-effort that detects most quickly rather than all with high delay. To end, combine techniques efficient, graph-based matching patterns effective load shedding discards some input while minimising loss accuracy. Experiments large-scale real-world datasets illustrate robustness our approach terms runtime performance and accuracy under diverse conditions.
منابع مشابه
Overload Management in Data Stream Processing Systems with Latency Guarantees
Stream processing systems are becoming increasingly important to analyse real-time data generated by modern applications such as online social networks. Their main characteristic is to produce a continuous stream of fresh results as new data are being generated at real-time. Resource provisioning of stream processing systems is difficult due to time-varying workload data that induce unknown res...
متن کاملApproximated Bayesian Inference for Massive Streaming Data
Extracting meaningful information out of massive streaming data is a significant challenge due to the high dimensionality of the inference problem and limits on available computational power and memory. While Bayesian models often convey significant inferential advantages, standard computational algorithms relying on Markov chain Monte Carlo are infeasible to apply. This motivates online variat...
متن کاملTowards Detecting Rumours in Social Media
The spread of false rumours during emergencies can jeopardise the well-being of citizens as they are monitoring the stream of news from social media to stay abreast of the latest updates. In this paper, we describe the methodology we have developed within the PHEME project for the collection and sampling of conversational threads, as well as the tool we have developed to facilitate the annotati...
متن کاملComputational Graph Analytics for Massive Streaming Data
Handling the constant stream of data from health care, security, business, and social network applications requires new algorithms and data structures. We present a new approach for parallel massive analysis of streaming, temporal, graph-structured data. For this purpose we examine data structure and algorithm trade-offs that extract the parallelism necessary for high-performance updating analy...
متن کاملStreaming Algorithms for Distributed, Massive Data Sets
Massive data sets are increasingly important in a wide range of applications, including observational sciences, product marketing, and monitoring and operations of large systems. In network operations, raw data typically arrive in streams, and decisions must be made by algorithms that make one pass over each stream, throw much of the raw data away, and produce \synopses" or \sketches" for furth...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Vldb Journal
سال: 2022
ISSN: ['0949-877X', '1066-8888']
DOI: https://doi.org/10.1007/s00778-022-00750-4